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Abstract 

We show improved approximation guarantees for the traveling sales¬ 
man problem on cubic bipartite graphs and cubic graphs. For cubic 
bipartite graphs with n nodes, we improve on recent results of Karp 
and Ravi by giving a “local improvement” algorithm that finds a tour 
of length at most 5/4n — 2. For 2-connected cubic graphs, we show 
that the techniques of Momke and Svensson can be combined with the 
techniques of Correa, Larre and Soto, to obtain a tour of length at 
most (4/3 — l/8754)n. 


1 Introduction 

The traveling salesman problem (TSP) is one of the most famous and widely 
studied combinatorial optimization problems. Given a set of cities and pair¬ 
wise distances, the goal is to find a tour of minimum length that visits every 
city exactly once. Even if we require the distances to form a metric, the 
problem remains NP-hard. The classic Christofides’ algorithm [5] finds a 
tour that has length at most | times the length of the optimal tour. Despite 
much effort in the 35 years following Christofides’s result, we do not know 
any algorithms that improve on this guarantee. 

One approach that has often been useful in designing approximation 
algorithms is the use of linear programming. In this context, a major open 
question is to determine the integrality gap of the subtour elimination linear 
program or Held-Karp relaxation OE]; the integrality gap is the worst-case 
ratio of the length of an optimal tour to the optimal value of the relaxation. 
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tion (WISE) and by a grant from the Simons Foundation (#359525, Anke Van Zuylen). 


1 



Examples are known in which the length of the optimal tour is | times the 
value of the Held-Karp relaxation, and a major open question is whether 
this is tight. 

Recent years have seen some exciting progress towards answering this 
question on graph metrics, also called the graph-TSP. In this special case of 
the metric TSP, we are given an unweighted graph G = (E, E) in which the 
nodes represent the cities, and the distance between two cities is equal to the 
shortest path in G between the corresponding nodes. Examples are known 
in which the ratio between the length of the optimal tour and the Held-Karp 
relaxation is |, where the examples are in fact graph-TSP instances with 
an underlying graph G that is 2-connected and subcubic (every node has 
degree at most three). 

The graph-TSP thus captures many of the obstacles that have prevented 
us from obtaining improved approximations for general metrics, and much 
recent research has focused on finding improved algorithms for the graph- 
TSP. The first improvement for graph-TSP metrics is due to Gamarnik, 
Lewenstein and Sviridenko [8] , who show an approximation guarantee strictly 
less than | for cubic, 3-connected graphs. Aggarwal, Garg and Gupta [T] 
give a |-approximation algorithm for this case. Boyd, Sitters, Van der Ster, 
and Stougie show that there is a |-approximation algorithm for any cu¬ 
bic graph, and Momke and Svensson m show this holds also for subcubic 
graphs. 

Momke and Svensson also show a 1.461-approximation algorithm if we 
make no assumptions on the underlying graph G. Mucha m improves their 
analysis to show an approximation guarantee of Sebd and Vygen m 
combine the techniques of Momke and Svensson with a clever use of ear 
decompositions, which yields an approximation ratio of 1.4. 

As mentioned previously, for subcubic graphs, examples exist that show 
that we cannot obtain better approximation guarantees than | unless we 
use a stronger lower bound on the optimum than the Held-Karp relaxation 
or subtour elimination linear program. Correa, Larre and Soto [6] show that 
this is not the case for cubic graphs. They rehne the techniques of Boyd 
et al. [3] and show how to find a tour of length at most (| — glise) ^ 
the graph-TSP on a 2-connected cubic graph G, where n is the number of 
nodes. Correa, Larre and Soto also consider the graph-TSP on planar cubic 
bipartite 3-connected graphs, and give a (| — ^)-approximation algorithm. 
Planar cubic bipartite 3-connected graphs are known as Barnette graphs, 
and a long-standing conjecture in graph theory by Barnette [2] states that 
all planar cubic bipartite 3-connected graphs are Hamiltonian. Recently, 
Karp and Ravi [10] gave a ^-approximation algorithm for the graph-TSP on 
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a superset of Barnette graphs, cubic bipartite graphs. 

In this paper, we give two results that improve on the results for cubic 
graph-TSP. For the graph-TSP on (non-bipartite) cubic graphs, we show 
that the techniques of Momke and Svensson mi can be combined with 
those of Correa et al. to find an approximation algorithm with guaran¬ 
tee (| — g^). We note that independent of our work, Candrakova and 
Lukot’ka [Ij showed very recently, using different techniques, how to ob¬ 
tain a 1.3-approximation algorithm for the graph-TSP on cubic graphs. For 
connected bipartite cubic graphs, we give an algorithm that finds a tour of 
length at most |n — 2, where n is the number of nodes. The idea behind 
our algorithm is the same as that of many previous papers, namely to find 
a cycle cover (or 2-factor) of the graph with a small number of cycles. Our 
algorithm is basically a simple “local improvement” algorithm. The key idea 
for the analysis is to assign the size of each cycle to the nodes contained in it 
in a clever way; this allows us to give a very simple proof that the algorithm 
returns a 2-factor with at most n/8 components. We also give an example 
that shows that the analysis is tight, even if we relax a certain condition in 
the algorithm that restricts the cases when we allow the algorithm to move 
to a new solution. 

The remainder of this paper is organized as follows. In Section [2l we 
describe and analyze our algorithm for the graph-TSP on cubic bipartite 
graphs, and in Section [3l we give our improved result for non-bipartite cubic 
graphs. 

2 The Graph-TSP on Cubic Bipartite Graphs 

In the graph-TSP, we are given a graph G = {V,E), and for any u,v G V, 
we let the distance between u and v be the number of edges in the shortest 
path between u and v in G. The goal is to find a tour of the nodes in V that 
has minimum total length. A 2-factor of G is a subset of edges FEE, such 
that each node in V is incident to exactly two edges in F. Note that if F 
is a 2-factor, then each (connected) component of {V, F) is a simple cycle. 
If C is a component of {V,F), then we will use V{G) to denote the nodes 
in C and E{C) to denote the edges in G. The size of a cycle G is dehned 
to be \E{C)\ (which is of course equal to |I/(C)|). Sometimes, we consider 
a component of {V, F \ E') for some E' E F. A component of such a graph 
is either a cycle C or a path P. We define the length of a path P to be the 
number of edges in P. 

The main idea behind our algorithm for the graph-TSP in cubic bipartite 
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graphs (and behind many algorithms for variants of the graph-TSP given 
in the literature) is to find a 2-factor F in G such that {V, F) has a a 
small number of cycles, say k. We can then contract each cycle of the 2- 
factor, find a spanning tree on the contracted graph, and add two copies 
of the corresponding edges to the 2-factor. This yields a spanning Eulerian 
(multi)graph containing n -|- 2{k — 1) edges. By finding a Eulerian walk in 
this graph and shortcutting, we get a tour of length at most n -|- 2A: — 2. In 
order to get a good algorithm for the graph-TSP, we thus need to show how 
to find a 2-factor with few cycles, or, equivalently, for which the average size 
of the cycles is large. 

In Section 12.21 we give an algorithm for which we prove in Lemma [5] 
that, given a cubic bipartite graph G = {V,E), it returns a 2-factor with 
average cycle size at least 8. By the arguments given above, this implies the 
following result. 

Theorem 1. There exists a ^-approximation algorithm for the graph-TSP 
on cubic bipartite graphs. 

Before we give the ideas behind our algorithm and its analysis in Sec¬ 
tion EH we begin with the observation that we may assume without loss 
of generality that the graph has no “potential 4-cycles”: a set of 4 nodes S 
will be called a potential 4-cycle if there exists a 2-factor in G that contains 
a cycle with node set exactly S. The fact that we can modify the graph so 
that G has no potential 4-cycles was also used by Karp and Ravi [TO] . 

Lemma 1. To show that every simple cubic bipartite graph G = {V,E) 
has a 2 -factor with at most |K|/8 components, it suffices to show that every 
simple cubic bipartite graph G' = {V',E') with no potential f-cycles has a 
2 -factor with at most |KV8 components. 

Proof. We show how to contract a potential 4-cycle S' in G to get a simple 
cubic bipartite graph G' with fewer nodes than the original graph, and how, 
given a 2-factor with average component size 8 in G', we can uncontract S 
to get a 2-factor in G without increasing the number of components. 

Let S = {vi,V 2 ,V 3 ,Vi} be a potential 4-cycle in G, i.e., E[S] contains 4 
edges, say {ui, U 2 }, { 1 ^ 2 , ^^ 3 }) {^^ 3 ; 1 ^ 4 }, {^’ 1 ,^ 4 }, and there exists no node U 5 0 S 
that is incident to two nodes in { 01 , 02 , 03 , 04 } (since in that case a 2-factor 
containing a cycle with node set S would have 05 as an isolated node, and 
this cannot be a 2 -factor since U 5 must have degree 2 in a 2 -factor). 

We contract S, by identifying 01,03 to a new node Uoddj and identifying 
02,04 to a new node Ueven- We keep a single copy of the edge {uodd, ^even}- 
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The new graph G' is simple, cubic and bipartite, and \V'\ = \V\ — 2. See 
Figured] for an illustration. 



Figure 1: The 4-cycle on the left is contracted, by identifying ui, U 3 to a new 
node Uodd, and identifying ^ 2,^4 to a new node Uevenj and keeping a single 
copy of the edge {uodd, i^even}, to obtain the simple cubic bipartite graph on 
the right. 

Given any 2 -factor in G', we can “uncontract” S and find a 2 -factor in G 
with at most as many components as the 2 -factor in G': If the 2 -factor on G' 
does not contain {uodd, ^even} then it must contain the other 4 edges incident 
to Uodd and Ueven- When uncontracting S, this gives one edge incident to each 
Vi,i = 1 ,..., 4. Since all other node degrees are even, the graph consists of 
even cycles and two paths with endpoints in {ui,U 2 ,U 3 ,U 4 }. We can choose 
to add the edges or the edges {^ 2 )^ 4 }; both of 

these choices give a 2 -factor, and at least one of the two options must give 
a 2 -factor in which all 4 nodes in S are in the same cycle. If the 2 -factor on 
G' does contain {uodd; ^^even} then it must contain one other edge incident to 
Uodd and to Ueven- When uncontracting S, this gives one edge incident to vi 
or U 3 , and one edge incident to V 2 or U 4 . Suppose without loss of generality 
the edges are incident to vi and V 2 - Then, we add edges { 1 ^ 2 , 1 ^ 3 }, {'^ 3 ; ^^ 4 }, 
and {ui, U 4 } to get a 2-factor. Note that it is again the case that all 4 nodes 
in S are in the same cycle in the resulting 2 -factor. □ 

2.1 A Local Improvement Heuristic 

A cubic bipartite graph has a perfect matching (in fact, it is the case that the 
edge set can be decomposed into three perfect matchings), and given a cubic 
bipartite graph G = {V,E), we can obtain a 2-factor F by simply finding a 
perfect matching M and letting F = F\M. Conversely, if T is a 2-factor for 
G, then F\F is a perfect matching in G. Now, given an arbitrary 2-factor 
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Fi, we can use these observations to bnild a second 2 -factor F 2 snch that 
most nodes that are in a small cycle in {V, Fi) are in a long cycle in {V, F 2 ): 
The 2-factor F 2 is constructed by taking the matching E\Fi and adding half 
of the edges from each cycle in {V, Fi). Since each cycle is even, its edges can 
be decomposed into two perfect matchings, and we may choose either one 
of them to add to F 2 . We will say 2-factor F 2 is locally optimal with respect 
to Fi if F 2 contains all edges in E \ Fi and for each cycle C in {V,Fi), 
replacing F 2 by F 2 AE{C) does not reduce the number of components of 
(y, T 2 ) (where A denotes the symmetric difference operator). 

The essence of our algorithm is to start with an arbitrary 2-factor Fi, 
find a 2 -factor F 2 that is locally optimal with respect to Fi, and return the 
2 -factor among Fi,F 2 with the smallest number of components. 

If we consider a 6 -cycle C in (I/, Ti), and a 2-factor F 2 that is locally 
optimal with respect to Fi, then it is not hard to see that at least two edges 
of C will be part of the same cycle, say D, in (I/, T 2 ). Moreover, the fact 
that the graph G has no potential 4-cycles can be shown to imply that D 
has size at least 10. This observation motivates the condition in Lemma [2] 
below that for any C in (I/, Ti), there should exist D in (14, T 2 ) of size at 
least 10, such that \V{C) fl V{D)\ > 4. 

In Lemma [2] we show that this condition suffices to guarantee that either 
(14, Fl) or (14, F 2 ) has at most |I4|/8 cycles. In Lemma[3]we show the condi¬ 
tion holds for F 2 that is locally optimal for Fi, provided that all cycles in Fi 
are chordless: An edge {x,y} is a chord for cycle C in (14, Fi), if x,?/ G C, 
and {x,y} € F\Fi; a cycle will be referred to as chorded if it has at least 
one chord, and chordless otherwise. 

A few more details are needed to deal with the general case when Fi is 
not necessarily chordless; these are postponed to Section Y2?2\ 

Lemma 2. Let G = (14, F) he a simple cubic bipartite graph that has no 
potential 4-cycles, let Fi and F 2 be 2-factors in G, such that for any cy¬ 
cle C in (14, Fl), there exists a cycle D in (14, F 2 ) of size at least 10 such 
that |I4(C') n I4(F)| > 4. Then either (14, Fi) or (V^, F 2 ) has at most |I4|/8 
components. 

Proof. Let Ki be the number of components of {V,Fi) for i = 1,2. Note 
that it snfRces to show that 'jKi -|- (1 — 7 ) 4^2 < |F |/8 for some 0 < 7 < 1 . 

In order to do this, we introduce a value a{v) for each node. This valne 
is set based on the size of the cycle containing v in the second 2 -factor, and 
they will satisfy '^(^) ~ ^ every cycle D in the second 2-factor 

(14, F 2 ). Hence, we have that equal to the number of cycles in 
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(y, F 2 ). We will then show that the condition of the lemma guarantees that 
for a cycle C in {V,Fi), 


v&C 

This suffices to prove what we want: we have K 2 = YIvgV '^(^) — 
which is the same as jKi + jK 2 < 

The basic idea to setting the a-values is that if v is in a cycle D in iV, F 2 ) 
of size k, then we have a{v) = p The only exception to this rule is when D 
has size 10; in this case we set a{v) for u € -D to either g or There will 
be exactly 8 nodes with a{v) = ^ and 2 nodes with a{v) = g. The nodes 
u in H with a{v) = ^ are chosen in such a way that, if there is a cycle C 
in {V,Fi) containing at least 4 nodes in D, then at least 4 of the nodes in 
V{C) n V{D) will have a{v) = It is possible to achieve this, since the 
fact that D has 10 nodes implies that (V, Fi) can contain at most two cycles 
that intersect 44 in 4 or more nodes. 

It is easy to see that Q holds: by the condition in the lemma, any cycle 
C contains at least 4 nodes v such that a{v) < Since we assumed in 
addition that G has no potential 4-cycles, we also know that a{v) < g for all 

other u G V{C). Hence ^ iC)\-4)+4-^ = i|H(C')|-i. □ 

By Lemma [21 it is enough to hnd a 2-factor F 2 that satishes that every 
cycle in the first 2-factor, Fi, has at least 4 nodes in some “long” cycle of 
F 2 (where “long” is taken to be size 10 or more). The following lemma 
states that a locally optimal F 2 satisfies this condition, provided that Fi is 
chordless. 

Lemma 3. Let G = {V, E) he a simple cubic bipartite graph that has no 
potential 4-cycles, let Fi he a chordless 2-factor in G, and let F 2 he a 2-factor 
that is locally optimal with respect to Fi. Then for any cycle G in (y,Fi), 
there exists a cycle D in (14,^2) of size at least 10 such that \ V{G)nV{D)\ > 

4. 

Proof. Suppose F 2 is locally optimal with respect to Fi, and assume by 
contradiction that there is some cycle G in iV, Fi) such that (H, F 2 ) contains 
no cycle of size at least 10 that intersects G in at least 4 nodes. Let Tg = 
F 2 AE{G)-, we will show that {y,Ff) has fewer components than (H, T 2 ), 
contradicting the fact that F 2 is locally optimal with respect to Fi. 

Consider an arbitrary cycle D in {V,F 2 ) that intersects G. We will first 
show that any node v in D will be in a cycle in iVyf) that is at least as 
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large as D. This shows that the number of cycles of (y,F 2 ) is at most the 
number of cycles of (F, T 2 ). We then show that it is not possible that for 
every node, its cycle in (V, F!^) is the same size as the cycle containing it in 
{V,F2). 

If D contains exactly one edge, say e, in C, then in iV^Fl^), the edge e 
in D is replaced by an odd-length path. Hence, in this case the nodes in D 
will be contained in a cycle in {V^F!^) that is strictly larger than D. 

If D contains k > 1 edges in C, then D has size at most 8 , since otherwise 
D contradicts our assumption that no cycle exists in {V, F 2 ) of size at least 
10 that intersects C in at least 4 nodes. We now show this implies that D 
has size exactly 8 and k = 2: The size of D is either 6 or 8 since G has 
no potential 4-cycles. Note that D alternates edges in C and odd-length 
paths in (H, T 2 \ E{C)). Since C is chordless, the paths cannot have length 
1 and must thus have length at least 3. We thus have that D must consist 
of exactly two edges from C, say ei, 62 , separated by two paths of length 3, 
say Pi,P 2 . 

Since Pi and P 2 do not contain edges in C, {V, F 2 ) also contains the 
edges in Pi and P 2 . Hence, to show that all nodes in D are in cycles of size 
at least 8 in {V, P 2 ), it now suffices to show that the cycles containing Pi and 
P 2 in {V, F 2 ) have size at least 8 . Consider the cycle in (H, P 2 ) containing Pi; 
besides Pi, the cycle contains another path, say P(, connecting the endpoints 
of Pi, and this path must have odd length > 3 since G is bipartite and has 
no potential 4-cycles. Furthermore, P{ starts and ends with an edge in C, 
by definition of P 2 = F 2 AE{G). Note that P[ thus cannot have length 3, 
as this would imply that the middle edge in P[ is a chord for G. So P{ has 
length at least 5, and the cycle in (H, P 2 ) containing Pi thus has size at least 
8 . Similarly, the cycle containing P 2 in (VjF^) has size at least 8 . 

We have thus shown that all nodes in D are in cycles of size at least 
|H(P)| in (H,P 2 ), and hence, (H,P 2 ) has at most as many cycles as (H,P 2 ). 
Furthermore, it follows from the argument given above that the number of 
cycles in (H, P 2 ) and (H, P 2 ) is the same only if all nodes in G are in cycles of 
size 8 in both (H, P 2 ) and (H, P 2 ) and each such cycle consists of two edges 
from E{G) and two paths of length 3 in {V, P 2 \ E{G)). 

We now show by contradiction that the latter is impossible. Suppose G 
is such that both in (H, P 2 ) and in (H, P 2 ) every node in V{G) is contained 
in a cycle containing two edges from G. Then |I/(C')| must be a multiple 
of 4, say |H(C')| = Ak. Let the nodes of G be labeled 1, 2,... , 4/c = 0, such 
that {2i -t- 1,2z -|- 2} G P 2 and {2i, 2i -|- 1} G P 2 for i = 0,..., 2A: — 1. We 
also define a mapping p{i) for every i = 1 ,..., Ak, such that (H, P 2 \ E{C)) 
contains a path (which, by our assumption has length 3) from i to p{i) for 



i = 1,... ,4:k. Observe that by the definition of the mapping, p{p{i)) must 
be equal to i (mod Ak) for f = 1 ,..., Ak. 

Let p(l) = i, then the fact that edge {1, 2} is in a cycle with one other 
edge from C in (1^,-F 2 ) implies that either + 1} € F 2 or — 1} € F 2 , 
and that either p{2) = £ + 1 or p(2) = i — 1. In the first case, edge {2,3} 
must be in a cycle with {£ + !,£+ 2} in the second 2-factor (V, F 2 }, and thus 
p(3) = £ + 2. In the second case, {2, 3} must be in a cycle with jf' — 1, £ — 2} 
in (V, F 2 ), and thus p(3) = £ — 2. Repeating the argument shows that either 
p{i) = £ + {i — 1) (mod 4A:) for i = 1,..., 4A:, or p{i) = £ — {i — 1) (mod Ak) 
for f = 1 ,..., Ak. 

The first case gives a contradiction to the fact that G is bipartite: note 
that p{p{i)) = 2£ + i — 2 (mod Ak) and this must be equal to i. Hence, 
£ = \ (mod 2 k)] in other words, £ is odd, which cannot be the case since 
(H, F 2 \ E{C)) has a path of length 3 from 1 to £ (since p{l) = £.) and if £ 
were odd, then C would have a path of even length from 1 to £, and thus G 
would contain an odd cycle. 

Now suppose that p{i) = £ — [i — 1) (mod Ah) for z = 1,..., Ak. From 
the previous argument we know that £ must be even, since otherwise G is 
not bipartite. But then p(^/2) = £ — {£/2 — 1) = £I2 + 1. In other words, 
node £/2 is connected to node f/2 + 1 in (H, F 2 \ E{C)). But since the edge 
{£I2,£I2 + 1} is either in F 2 or in T 2 , {f'/2,£/2 -|- 1} is the only edge from 
G in its cycle in either (H, F 2 ) or (R, T}), contradicting the assumption on 
G. □ 

It may be the case that Lemma [3] also holds for cycles in (y,Fi) that 
do have chords, but we have not been able to prove this. Instead, there is 
a simple alternative operation that ensures that a cycle in [V,Fi) with a 
chord intersects at least one “long” cycle of size at least 10 in (H, F 2 ) in 4 or 
more nodes. The algorithm described next will add this operation, and for 
technical reasons it will only modify F 2 with respect to a cycle G in (y,Fi) 
if the cycle G does not yet intersect a long cycle in (H, F 2 ) in 4 or more 
nodes. 

2.2 A 2-Factor with Average Cycle Size 8 

We give our algorithm in Algorithm [T] The algorithm fixes a 2-factor Fi and 
initializes F 2 to be a 2-factor that contains all edges in E\Fi. The algorithm 
then proceeds to modify F 2 ] note that Fi is not changed. Figure [2] illustrates 
the modification to F 2 in the case of a chorded cycle Gi. 

We need to prove that the set of edges F 2 remains a 2-factor throughout 
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Let G = {V, E) be a bipartite cubic graph, with potential 4-cycles 
contracted using Lemma [TJ 

Let Fi be an arbitrary 2 -factor in G, and let Gi, ... ,Ck be the cycles 
in {V,F,). 

For each cycle C* in {V, Fi), let M{Ci) C E{Gi) be a perfect matching 
on V{Gi). 

Initialize F 2 = {E\Fi) U Uti M[Ci). 

while there exists a cycle Ci such that \V{Ci) H V[D)\ < 4 for all 
cycles D in {V,F 2 ) of size at least 10 do 
if Gi is a chordless cycle then 
I F2^F2AE{Gi). 
else 

Let {x,y} be a chord for Gi, let Pi,P 2 be the edge disjoint 
paths in Ci from x to y. 

Relabel Pi and P 2 if necessary so that Pi starts and ends with 
an edge in Pi \ P 2 . 

F2^{F2AE{Pi))\{x,y}. 

end if 
end while 

Uncontract the 4-cycles in (V,Fj) for j = 1,2 using Lemma [TJ 
Return the 2-factor among Pi, P 2 with the smaller number of 
components. 

Algorithm 1: Approximation Algorithm for Cubic Bipartite TSP 


the course of the algorithm, that the algorithm terminates, and that upon 
termination, either (U, Pi) or (U, P 2 ) has at most |U |/8 components. The 
latter is clear: if the algorithm terminates, then the condition of Lemma [2] 
is satisfied, and therefore one of the two 2 -factors has at most |U |/8 compo¬ 
nents. 

To show that P 2 is a 2-factor and that the algorithm terminates is a little 
more subtle. In order to show this, it will be helpful to know that each cycle 
in {V, Fi) alternates edges in Pj n Pj+i and edges in Fi \ Pj+i for i = 1,2 
(where subscripts are modulo 2, so P 3 = Pi). This is true initially, however, 
it is not the case that this property continues to hold for all cycles. We will 
show that it does hold in certain cases, which turn out to be exactly the 
cases “when we need it”. In the following, we will say a cycle or path in 
iy,Fi) is alternating (for Pi and P 2 ) if it alternates edges in Fi n Pj+i and 
Fi \ Pj+i. We will say that a cycle C in (U, Pi) is violated if there exists no 
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Figure 2: The figure on the left shows a chorded cycle C in Fi of size 10 
and all edges in G that have both endpoints in C. The dashed edges are 
in Fi \ © 2 ) and non-dashed edges are in F 2 (where not all edges in F 2 that 
are incident on the nodes are shown). The figure on the right shows how 
Algorithm [T] would update F 2 . 


D of size at least 10 in (©,T 2 ) such that \V{C) n V{D)\ > 4. 

Lemma 4. Algorithm [I] maintains that F 2 is a 2-factor that satisfies the 
following properties: 

(1) if C in {V,Fi) is violated, then C is alternating for Fi and F 2 ; 

(2) if D in (F, © 2 ) is not alternating for Fi and F 2 , then D has size at 
least 10. 

Proof. We prove the lemma by induction on the algorithm. Initially, F 2 con¬ 
sists of E\Fi and U!Li M{Ci), which are two edge-disjoint perfect matchings 
on V. Hence, F 2 is a 2-factor, and the two properties hold for all cycles in 
(F,Fi) and (^,^ 2 ). 

Suppose the lemma holds and we modify F 2 by considering some violated 
cycle C. The two properties of the lemma imply the following: 

Claim 1. If C is violated, then (F, F 2 \E{C)) consists of even cycles and 
odd-length paths, where paths that are not alternating for Ei and E 2 have 
length at least 9. 

Proof of Claim: For each path in the graph (F, F 2 \E{C)) there exists some 
cycle D in (F, © 2 ) such that the path results when removing E[C) fl E{D) 
from D. If D is alternating for Pi and P 2 , then the path must have the 
same property, and it must start and end with an edge in F 2 \ Fi. Hence, 
the path must have odd length if D is alternating. If D is not alternating 
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then D has size at least 10 by Property (2), so C can have at most one 
edge in common with D, since otherwise C is not violated. Hence, the path 
obtained by removing the unique edge in E{C) fl E{D) has length at least 

9, and its length must be odd, since D is an even cycle. o 

If C is chordless, then we modify E 2 to E 2 = F 2 AE{C). Clearly, F 2 is 
again a 2-factor, and Property (1) remains satished. Furthermore, any cycle 
in (P, Fg) that is not alternating for Fi and F 2 either also existed in (P, F 2 ) 
and hence it has size at least 10, since Property (2) holds for F 2 , or the cycle 
contains a path in (P, F 2 \ E{C)) that is not alternating for Fi and F 2 , and 
this path has length at least 9 by the claim. So Property (2) holds for 

Now consider the modihcation of F 2 when considering a chorded cycle 
C in {V,Fi). Let Pi be as defined in the algorithm. First consider F 2 = 
F 2 AE{Pi)-, every node is incident to two edges in except for x and y, 
which are incident to three edges in F 2 , namely two edges in E{C) plus the 
edge {x,y}. Hence, removing {x,y} will give a new 2-factor, say F 2 ■ The 
modification from F 2 to F 2 is exactly the modihcation made to P 2 by the 
algorithm. 

We now show that the two properties are satished. Clearly, C is not 
alternating for Pi and P^^, so in order to maintain Property (1), we need 
to show that C is no longer violated. To do this, we show that {V^F^) 
contains a cycle of size at least 10 that contains x, y and their 4 neighbors 
in C. First, suppose by contradiction that after removing {x,y}, x and y 
are not in the same cycle. Consider the component of (P, F 2 ) containing x: 
starting from x, it alternates edges in E(C) and paths in P 2 \P(C'), starting 
and ending with an edge in E{C). By Claim [T] the paths in (P, P 2 \ E{C)) 
have odd length, and hence the component containing x must be an odd 
cycle, contradicting the fact that G is bipartite. So, x and y must be in the 
same cycle in iV^Fll). This cycle must thus consist of two odd-length paths 
from X to y, each starting and ending with an edge in F 2 flPi. These paths 
cannot have length 3, because this would imply that the path plus the edge 
{x, y} would form a potential 4-cycle. Hence, the cycle in (P, F 2 ) containing 
X and y has size at least 10. 

For Property (2), note that any cycle D in (P, F 2 ) that is not alternating 
for Pi and F 2 either (i) existed in (P, P 2 ) and therefore has size at least 10, 
or (ii) contains x and y and we showed above that this cycle has size at least 

10, or (iii) it contains a path in (P, P 2 \ E{C)) that is not alternating for Pi 

and P 2 , and by the claim this path has length at least 9. Hence, Property 
(2) is satished by F!^. □ 
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Lemma 5. Given a cubic bipartite graph G = (y,E), Algorithm[l\ returns 
a 2-factor in G with at most \V\/^ components. 

Proof. By Lemma [T] it suffices to show that the current lemma holds if G 
has no potential 4-cycles. By the termination condition of Algorithm [T] and 
Lemma [21 the 2-factor returned by the algorithm does indeed have at most 
|y|/8 components, so it remains to show that the algorithm always returns 
a 2-factor. 

By Lemma m the algorithm maintains two 2-factors Fi and F 2 . Observe 
that if a cycle G is not violated, then this continues to hold throughout the 
remainder of the algorithm: Let D be a cycle of size at least 10 in {V,F 2 ) 
such that \V{G)r\V{D)\ > 4. The only possible changes to D will be caused 
by a violated cycle G', which necessarily contains at most one edge in D: by 
Lemma 0] G' is alternating for Fi and F 2 , so if G' contains more than one 
edge in D, G' cannot be violated. The modification of F 2 with respect to 
F{G') can therefore only cause the cycle D to become a larger cycle D' where 
y{D') 2 V{D). So D' will have size at least 10, and \V{G) D V{D')\ > 4. 

It remains to show that if we modify F 2 with respect to some violated 
cycle G, then G is not violated for the new 2-factor F^. If C is not chordless, 
then this holds because G is not alternating for Fi and the new 2-factor F 2 , 
so by Lemma m G is not violated. If G is chordless and violated, then by 
Claim [H {V,F 2 \ F{G)) consists of even cycles and odd-length paths. The 
proof of Lemma [3| then shows that taking the symmetric difference of F 2 
with F{G) (strictly) reduces the number of components. This implies that 
for F 2 = F 2 AF{G), cycle G is not violated: otherwise, we could apply the 
same arguments to show that {V, F 2 AF{G)AF{G)) has strictly fewer cycles 
than (I/, T 2 ), but this is a contradiction since F 2 AF{G)AF{G) = F 2 . □ 

In the appendix, we give an example on 48 nodes that shows our analysis 
of Algorithm [1] is tight. In fact, the example is also tight for the local im¬ 
provement heuristic from Section [2.II and for the local improvement heuristic 
we obtain if we allow Algorithm [1] to modify F 2 for cycles G that are chorded 
and/or do have at least two edges in a cycle of size 10 or more in (I/, T 2 ). 

3 Cubic graphs 

We now consider cubic graphs, in other words, we drop the requirement 
that the graph is bipartite. We assume the graph is 2-connected. The best 
known approximation result for graph-TSP on a 2-connected cubic graphs 
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G = {V, E) is due to Correa, Larre and Soto [ 6 ] who show how to find in 
polynomial time a tour of length at most (| — gllse) 1 ^ 1 - 

One obstacle for their techniques are chorded 4-cycles, i.e., a set of 4 
nodes (ui, U 2 , ua, ^ 4 ) such that the subgraph of G induced by {ui, r; 2 , U 3 , U 4 } 
contains edges {vi,V2}, {v2,V3}, {^ 3 ,^ 4 }, {u 4 , ui} and the “chord” {^ 2 ,^ 4 }. 
In fact, Correa et al. prove the following. 

Lemma 6 (Correa, Larre and Soto [ 6 ]). Consider a graph-TSP instance on 
a 2-connected cubic graph G = {V,E), and let B be the set of nodes in G 
contained in a chorded f-cycle. Then, we can find in polynomial time a tour 
of length at most ^\B\ + (| — g^)(|^\-S|) -|- 2. 

The proof of this lemma is contained in the proof of Theorem 2 in [ 6 ]. 
More precisely, it is shown that there exists a distribution over tours of 
expected size Ylvev where z{v) is the “average contribution” of 

V. In the proof of the Theorem 2 in [ 6 ] it is shown that Ylvev — 

On the other hand, chorded 4-cycles are “beneficial” for the analysis 
of the Momke-Svensson m algorithm for the graph-TSP on 2-connected 
subcubic graphs, as we will show next. 

Lemma 7. Consider a graph-TSP instance on a 2-connected subcubic graph 
G = {V, E), and let B be the set of nodes in G contained in a chorded f-cycle. 
Then, we can find in polynomial time a tour of length at most ||I4| — — |. 

Proof. For convenience, we will use the word “tour” to refer to a connected 
spanning Eulerian multigraph obtained by doubling and deleting some edges 
of G. Note that the graph-TSP instance on G indeed has a tour (obtained 
by shortcutting a Eulerian walk on this multigraph) of length at most the 
number of edges in the multigraph. Momke and Svensson m show that 
there exists a probability distribution over tours, such that each edge e G E 
appears an even number of times (zero or twice) with probability and the 
expected number of edges in the tour is ||E| — |. 

Now, we can simply contract chorded 4-cycles in a cubic graph to obtain 
a subcubic graph G, on which we can apply the Momke-Svensson algorithm 
to hnd a distribution over tours with |(|E| — f |E|) — | = ||I/| — |il| — | 
edges in expectation. Next, we uncontract the chorded 4-cycles in each of 
the tours T: Eor a chorded 4-cycle (ui,U 2 ,U 3 ,U 4 ) that is contracted to a 
node V, note that T contains an even number of edges incident to v. Hence, 
either both edges incident to u in G appear an odd number of times in T, 
or both appear an even number of times. In the first case, we uncontract 
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the chorded 4-cycle by adding the edges {vi,V 2 },{v 2 -,Vi] and {t;3,t;4}. In 
the second case, we uncontract the chorded 4-cycle by adding the edges 
{vi,V2}, {V2,V3}, {U 3 , U4}, {V 4 , Ui}. 

Since each edge appears an even number of times with probability 
the second case happens with probability and the first case thus happens 
with probability Hence, the expected number of edges in T increases by 
giving a total number of edges of||I4| — — |in expectation. □ 

Note that the bound in Lemma[6]is increasing in \B\ and the bound in 
Lemma [7] is decreasing in \B\. Setting the bound in Lemma E] equal to the 
bound in Lemma [7] gives \B\ = and thus shows that there exists a 

polynomial time algorithm for finding a tour of length at most (| — g^) \V\ 
for a graph-TSP instance on a 2-connected cubic graph G = (H, E). By an 
observation of Momke and Svensson P, this also implies a (| — g^)- 
approximation algorithm for cubic graph-TSP, i.e., the graph G does not 
have to be 2-connected. We have thus shown the following result. 

Theorem 2. There exists a -approximation algorithm for Graph- 

TSP on cubic graphs. 
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4 Tightness of the Analysis of Algorithm [1] 

We give an example of a cubic bipartite graph G = (U, E) for which both the 

2-factors Fi and F 2 that result from Algorithm [1] have |U|/8 components. 

The instance has 48 nodes, numbered 1 through 48, and (U, Fi) contains 

six cycles, four cycles of size 6, one cycle of size 10 and one cycle of size 14. 
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For brevity, we denote the cycles by only giving an ordered listing of their 
nodes; an edge between consecutive nodes and between the last and first 
node is implicit. The cycles in {V,Fi) are: 

Cl = (1,2, 3,4, 5,6), 

Ca = (7,8,9,10,11,12), 

Cs = (13,14,15,16,17,18), 

Cl = (19,20,21,22,23,24), 

C 5 = (25, 26, 27, 28, 29, 30, 31,32, 33, 34), 

Ce = (35, 36, 37, 38, 39,40,41,42,43,44,45,46,47,48). 

The second 2-factor (14, F 2 ) has six cycles as well, namely, three cycles of 
size 6 and three cycles of size 10. We again denote the cycles by giving an 
ordered listing of the nodes, but now a semicolon between subsequent nodes 
indicates that the nodes are connected by an edge in Fa \ Fi, and a comma 
denotes that they are connected by an edge in Fa flFi. The cycles in (14, Fa) 
are 


Di = (6,1; 30, 31; 42,43; 28, 29; 40,41;), Di = (18,13; 38, 39; 44,45;), 

F>a = (48, 35; 10,11; 4,5; 8,9; 2, 3;), 05 = (24,19; 26, 27; 32, 33;), 

D 3 = (34, 25; 16,17; 22, 23; 14,15; 20, 21;), Dq = (12, 7; 36,37; 46,47;). 

It is straightforward to verify that every node occurs in exactly one cycle 
in (14,Fi) and exactly one cycle in (14,Fa), and that each cycle in (14,Fi) 
alternates edges in Fi\Fa and edges in FiflFa, and that each cycle in (14, Fa) 
alternates edges in Fa flFi and edges in Fa \Fi. Furthermore, each cycle Ci 
in (14, Fi) has exactly two edges in a cycle D in (14, Fa) of size exactly 10. 

Local Optimum 

Figure [3] depicts each of the six cycles Ci in (14, Fi), together with the cycles 
Dj in (14, Fa) that intersect the given cycle Ci. For any of the cycles Ci, 
replacing Fa by F 2 AE{Ci) does not decrease the number of components of 
(14, Fa) for any cycle Ci, nor does the modihcation of Fa for a chorded cycle 
described in Algorithm [TJ 

Hence, if, rather than following Algorithm [H we would execute one of 
the two possible modifications of Fa suggested by the algorithm, as long as 
this reduced the number of components of (14, Fa), then the 2-factor Fa is in 
fact a local optimum with respect to this process since none of the possible 
modifications reduces the number of components. 
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We note that the instance does have a Hamilton cycle. This cycle con¬ 
tains subpaths of more than two adjacent edges in E \ Fi, so it can never 
be found from Fi and F 2 using the “moves” we defined, even if we allow 
“moves” that do not reduce the number of components of (H, T 2 ). We give 
the Hamilton cycle by again giving an ordered listing of the nodes, where a 
semicolon between subsequent nodes indicates that the nodes are connected 
by an edge in E\Fi, and a comma denotes that they are connected by an 
edge in Fi. 


(1;30,29;40,39, 38;13,14,15,16; 25, 26; 19,20, 21; 34,33; 24, 23,22; 17,18; 45, 
44,43; 28, 27; 32, 31; 42,41; 6, 5; 8, 7,12,11; 4, 3; 48,47,46; 37, 36,35; 10, 9; 2,1). 
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Figure 3: Depicted are six figures, one for each cycle Ci in {V, Fi) for i = 
1,... ,6. The white nodes are the nodes in the cycle Ci, and the dashed 
edges are the edges in Fi \ E{Ci). The non-dashed and thick dashed edges 
are edges in F 2 . For each Ci, the number of components of {V, F 2 ) does not 
decrease by replacing F 2 by F 2 AE{Ci). 


19 






























































